Scalable critical-path analysis and optimization guidance for hybrid MPI-CUDA applications
نویسندگان
چکیده
منابع مشابه
MPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling
Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...
متن کاملNear–Critical Path Analysis: A Tool for Parallel Program Optimization
Program activity graphs (PAGs) can be constructed from timestamped traces of appropriate execution events. Information about the activities on the k longest execution paths is useful in the analysis of parallel program performance. In this paper, four algorithms for finding the near–critical paths of PAGs are compared, including a best–first search (BFS) algorithm that is worst–case asymptotica...
متن کاملAn MPI-CUDA Implementation and Optimization for Parallel Sparse Equations and Least Squares (LSQR)
LSQR (Sparse Equations and Least Squares) is a widely used Krylov subspace method to solve large-scale linear systems in seismic tomography. This paper presents a parallel MPI-CUDA implementation for LSQR solver. On CUDA level, our contributions include: (1) utilize CUBLAS and CUSPARSE to compute major steps in LSQR; (2) optimize memory copy between host memory and device memory; (3) develop a ...
متن کاملRuntime affinity optimization for hybrid MPI + OpenMP solvers
In the physical sciences, it is not uncommon to encounter parallel numerical codes in which computation is partitioned between solution of multiple fundamentally different problems, related in a pipelined manner through their state variables. Such a multi-phase scheme might prove advantageous by reducing the “spin up” and “wind down” time that would otherwise be incurred when using separate cod...
متن کاملWorkflow Timed Critical Path Optimization
Approaches to shorten workflow execution time have been discussed in many area of computer engineering such as parallel and distributed systems, a computer circuit, and PERT chart for project management. To optimize workflow model structure of workflow, an approach with corresponding algorithms is proposed to cut timed critical path of workflow schema, which has the longest average execution ti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The International Journal of High Performance Computing Applications
سال: 2016
ISSN: 1094-3420,1741-2846
DOI: 10.1177/1094342016661865